Investigating the potential of 6-substituted 3-formyl chromone derivatives as anti-diabetic agents using in silico methods

In exploring nature's potential in addressing diabetes-related conditions, this study investigates the therapeutic capabilities of 3-formyl chromone derivatives. Utilizing in silico methodologies, we focus on 6-substituted 3-formyl chromone derivatives (1–16) to assess their therapeutic potential in treating diabetes. The research examined the formyl group at the chromone’s C-3 position. ADMET, biological activities, were conducted along with B3LYP calculations using 3 different basis sets. The analogues were analyzed based on their parent structure obtained from PubChem. The HOMO–LUMO gap confirmed the bioactive nature of the derivatives, NBO analysis was performed to understand the charge transfer. PASS prediction revealed that 3-formyl chromone derivatives are potent aldehyde oxidase inhibitors, insulin inhibitors, HIF1A expression inhibitors, and histidine kinase. Molecular docking studies indicated that the compounds had a strong binding affinity with proteins, including CAD, BHK, IDE, HIF-α, p53, COX, and Mpro of SARS-CoV2. 6-isopropyl-3-formyl chromone (4) displayed the highest affinity for IDE, with a binding energy of − 8.5 kcal mol−1. This result outperformed the affinity of the reference standard dapagliflozin (− 7.9 kcal mol−1) as well as two other compounds that target human IDE, namely vitexin (− 8.3 kcal mol−1) and myricetin (− 8.4 kcal mol−1). MD simulations were revealed RMSD value between 0.2 and 0.5 nm, indicating the strength of the protein–ligand complex at the active site.

www.nature.com/scientificreports/(1-16) were optimized using the B3LYP/6-31G(d,p), B3LYP/6-311G(d,p), and B3LYP/6-311 + + G(d,p) levels of theory.Frequencies were obtained to confirm that the minima had no imaginary frequencies.Experimental data was used to compare the theoretical values and identify the most reliable methods for calculating structural data.Previous research demonstrated 33 that B3LYP/6-31G(d,p) provided acceptable and computationally efficient results, while G3MP2, G3MP2B3, G4MP2, G3B3, and CBS-QB3 agree within 10 kJ mol −1 with each other, and within 18 kJ mol −1 with the results of B3LYP/6-31G(d,p).Energy difference from highest occupied molecular orbital (HOMO) to lowest unoccupied molecular orbital (LUMO) and the molecular electrostatic potential (MEP) map were calculated using GaussView 6.The HOMO, LUMO, and MEP maps can be found in Tables S1 to S19 and Figs.S1 to S25 in the supporting information (SI).Unless otherwise specified, all values in the text are for B3LYP/6-31G (d,p).The importance of the Frontier Molecular Orbital (FMO) in maintaining structural stability was demonstrated through the analysis of the HOMO and LUMO 34 .In addition, several chemical descriptors, including Energy Gap (E Gap ), Ionization Potential (IP), Electron Affinity (EA), Electronegativity (χ), Chemical Potential (μ), Hardness (η), Softness (σ), and Electrophilicity (ω), were calculated for compounds (1-16) using the equations below 35,36 : where μ and η were used to derive the electrophilicity indexes.Mullikan population analysis was employed to calculate the total atomic charges, while NBO analysis was utilized to determine the charges of each individual atom.To obtain atomic charges, single point energy and NBO calculations were conducted 37 .

Evaluation of physicochemical and pharmacokinetic properties
The physiochemical properties and effectiveness of the 3-formyl chromone derivatives (1-16) were evaluated using different tools.The swissADME server (www.swiss adme.ch) was used to obtain physicochemical properties, medicinal chemistry, and drug-likeness data compiled from the PubChem database (https:// pubch em.ncbi.nlm.nih.gov/ search/ search.cgi) 38 .In contrast, the AdmetSAR tool (http:// lmmd.ecust.edu.cn/ admet sar2/) and ADMET Predictor software version 9.5 were used to determine and analyze the ADMET properties.Canonical SMILES of each compound were used to obtain several toxicity values, CYP inhibitors, and hERG pIC50 scores.AdmetSAR provided both unregulated and downregulated values for each ADMET property.The Molinspiration online open-access server (https:// www.molin spira tion.com/ cgi-bin/ prope rties) 39 was used to evaluate the relationship between the compounds' physicochemical properties and molecular activity.The drug-likeness of the compounds was investigated as G-protein coupled receptor (GPCR) ligands, ion channel modulators (ICM), www.nature.com/scientificreports/kinase inhibitors (KI), nuclear receptor ligands (NRL), protease inhibitors (PI), and enzyme inhibitors (EI).The structures (1 − 16) were drawn in ChemBioDraw Ultra 14.0 to collect MDL Molfile format and then optimized structures converted to the canonical simplified molecular-input system (SMILES) before analysis.

Pharmacological activities
To determine the potential pharmacological activities of the compounds, the researchers employed the Prediction of Activity Spectra for Substances (PASS) tool, which can provide information on toxicity, mutagenicity, receptor regulation, mechanism, carcinogenicity, teratogenicity, and other pharmacological effects of a compound 40 .Specifically, they used the Pass Online server tool (http:// way2d rug.com/ passo nline), which offers access to biochemical information on compounds approved for medicinal use by the USA and the Russian Federation 41 .

Molecular docking
Preparation of ligands Their SDF files were retrieved from the PubChem database to obtain 3D structures for compounds 1-16, vitexin, myricetin, and the reference drug dapagliflozin.The GaussView 6 was employed to draw all the flavonoid ligands, and their structures were optimized at the B3LYP/6-31G (d,p) level of theory using Gaussian 16.Following that, the chemical structures of the selected ligands underwent energy minimization.Lastly, the structures were converted into the PDBQT format using the OpenBabel plugin of PyRx 0.8 software, which can be accessed at https:// pyrx.sourc eforge.io/ 42 .

Preparation of target protein
To perform molecular docking, molecular docking software selected appropriate proteins with potential binding sites to the ligands.The crystal structures of the required proteins were retrieved from the RSCB Protein Data Bank tool (https:// www.rcsb.org/) 43 , the largest archive of over 505,000,000 atomic coordinate and experimental data files 44 .For this study, a total of seven proteins were selected, including Bacterial Histidine Kinase (PDB ID: 3DGE), Insulin Degrading Enzyme (PDB ID: 6BF8), p53 protein (PDB ID: 7EAX), Alcohol Dehydrogenase of H.pylori (PDB ID: 3TWO), Hypoxia Inducible Factor 1 subunit alpha (PDB ID: 2WA4), Cyclooxygenase (PDB ID: 6Y3C), and SARS-CoV main protease (PDB ID: 6LU7).We were chosen the PDB IDs of our targeted proteins from different literatures.The protein structures were optimized using Chimera v1.16 45 , which involved removing water molecules and ligands and energy minimization.The default AMBER ff14SB was used for all the ligands and proteins.

Protein-ligand docking
AutoDock Vina 46 was used to determine the protein-ligand binding regions and identify the amino acid interactions, which served as the target protein for the ligands (1-16) during molecular docking.A grid box was selected to cover the entire protein, and the docking experiment was conducted with stable proteins and ligands.The protein-ligand binding was validated by re-docking, and the binding modes of the receptor-ligand interactions were visualized using version 1.16 of UCSF Chimera 44 , PyMol version 2.5 47 , and BIOVIA Discovery Studio visualizer 48 .UCSF Chimera identified the amino acids interacting with the ligands 44 , while PyMol generated 3D structures for the molecular docking images.BIOVIA Discovery Studio was used to identify the 2D protein-ligand interactions and determine the hydrogen density around the interacting residues with the protein.

Molecular dynamics simulations
To investigate the behavior of complex systems and gain more insight than traditional experimental methods, we conducted MD simulations using the GROMACS version 2021.6 48 with the AMBER99SB force field 49 , which provides a detailed description of atom interactions 50 .MD simulations were used to explore the docked complexes of PDB: 6BF8 and 6Y3C with compound 4. The topology parameters for the proteins were generated using the Galaxy European Server 51 , a popular platform for molecular modeling and simulation.To create a standard salt concentration and neutralize the system, the system was solvated with SPC water molecules in a triclinic box, and Na + ions were added 52 .The equilibration of the system was achieved through a position-restrained dynamic simulation (NVT) using the leap-frog algorithm at 300 K for 3000ps 53 .This ensured that the pressure and temperature remained constant and that the system reached a stable state.Subsequently, a production run was carried out for another 3000 ps with the same temperature and pressure.Through MD simulations, the current study provided insights into the behavior of the docked protein-ligand complexes.We analyzed MD trajectories using VMD 54 , PyMol 47 , and GROMACS programs.RMSD, Rg, and RMSF were calculated using GROMACS utility gmx rmsd, gmx gyrate, and gmx rmsf for all MD simulations performed at 298 K for 20 ns.Hydrogen bond analysis was performed using the GROMACS utility 'gmx hbond, ' and temperature and potential energy were also determined.

Vibrational spectra
The FTIR spectra of 3-formylchromone (1) showed that the characteristic IR bands calculated at B3LYP/6-31G(d,p) and corresponding assignment are presented in Fig. S1 and Table S21 in the SI, which is in good agreement with experimental results 71 .In general, the bands around 1665-1458 cm −1 at B3LYP/6-31G(d,p) are closer to experimental values 71 (around ~ 1652 to ~ 1464 cm −1 in the benzene ring are assigned to the skeletal C-C stretching modes.The calculated splitting was observed in the C-C stretches from C-CHO for 3-formylchromone has strong and medium bands, which were exhibited as ~ 1380 to ~ 1329 cm −1 , which is closer to experimental values (~ 1360 to ~ 1340 cm −1 .In the spectrum of ligand 1, the strong C=O stretching band in chromone appeared in the range 1690-1630 cm −1 , and calculated splitting was observed (Fig. S1 and Table S21 in the SI).Similarly, the experimental splitting was observed in the C = O stretches in the same regions.The 1 ligand by both antisymmetric and symmetric vibrations for the C-H stretching modes was observed at ~ 3005 to ~ 2905 cm −1 , which is closer to experimental values (~ 2972 to ~ 2900 cm −1 ).There is good agreement between the experimental X-ray data 71 and theoretical data (Fig. S1 of the SI).

Analysis of Frontier molecular orbitals
Investigating the FMO in the 3-formyl chromone derivatives (1-16) offers vital insights into their stability and reactivity 64 .The energy gap (E gap ) between the HOMO and LUMO known as the HOMO-LUMO energy gap, is a critical factor that determines their chemical reactivity, hardness, softness, chemical potential, and electrophilic index.A narrow E gap indicates softness, signifying high reactivity but low stability, while a wide Egap indicates high stability and low reactivity.The energy levels of the HOMO and LUMO orbitals help in understanding the compounds electron-donating and accepting properties, respectively.Our study presents the findings of FMO analysis, which are summarized in Table 2 and illustrated in Fig. 2 (see Figs. S2 to S19 in the Supplementary Information), providing a comprehensive understanding of the FMOs of the compounds.We have calculated the DFT calculations at 298.15 K.This temperature is close to room temperature and represents conditions relevant  to biological systems.Many biological processes, including those targeted by potential drugs, occur at or near body temperature.By performing calculations at 298.15 K, the results become more relevant to understanding how the molecules might behave within a living organism.
The FMO analysis of 3-formyl chromone derivatives (1-16) reveals essential insights into their stability and reactivity.Among the 6-substituted derivatives (9, 10, and 11), the HOMO-LUMO gap decreases in the order F > Cl > Br, ranging from 4.557 eV to 4.542 eV (Table 2).Soft ligands (Y = Br) exhibit a smaller HOMO-LUMO gap due to the greater polarizability of their valence electrons compared to fluorine (Table 2).This enhanced polarizability contributes to the electron cloud distortion in C6-halide substituted 3-formyl chromone derivatives.The soft base (Br), characterized by high polarizability and low electronegativity, and further decreases the HOMO-LUMO gap in C6-X substituted chromone rings.Conversely, the hard base (F) with higher electron density exhibits a larger HOMO-LUMO gap.A smaller gap indicates softness, associated with high reactivity and low stability, while a wider gap implies high stability and low reactivity.The energy levels of the HOMO and LUMO orbitals provide insights into the compounds' electron acceptor and donor properties, respectively.
For the 6-hydroxy substituted chromone derivatives, the HOMO-LUMO gap of compound 5 (6-hydroxy) is 4.573 eV, lower than that of compound 4 (6-OCH 2 CH 3 , 4.505 eV) and compound 3 (6-NH 2 , 4.233 eV), due to the electronic effects of the electron-rich chromone ring in 3-formyl chromone derivatives.Additionally, the presence of C6 alkyl substituents (-CH 3 , -CH 2 CH 3 , -CH (CH 3 ) 2 ) leads to a decreasing trend in the HOMO-LUMO gap, likely due to steric effects (Table 2).Compounds 2-16 exhibit higher energy gaps than the parent compound 1, indicating changes caused by EDG and EWG at the 6-substituted position.Vitexin and myricetin display lower energy gaps than the other compounds (2-16).The reference drug, dapagliflozin (D), has a higher energy gap than the other compounds.The results indicate that compounds (2-16) have higher reactivity than the reference drug D, as evidenced by their higher electronegativity (χ), chemical potential (μ), global hardness (η), and electrophilicity index (ω), as well as their lower softness (σ).The electrophilicity index 65 is a commonly used parameter for predicting biological activity and identifying reactive sites by measuring the energy lowering resulting from electron transfer between the HOMO and LUMO.Meanwhile, the lower electrophilicity index of compound 8 (3.428 eV) and reference drug D (1.939 eV) provides a basis for further analysis of its potential biological activity through molecular docking with a suitable protein.The higher dipole moment of compounds (2-16) than the reference drug suggested that they may have a better binding affinity.These findings make our compounds promising candidates for further investigation of their biological activity through molecular docking studies.
The DFT results revealed that compound 4 exhibited a strong binding affinity for insulin-degrading enzyme (IDE) with a binding energy of − 8.5 kcal mol −1 .Moreover, all docking values obtained for the ligands were superior to the reference drug dapagliflozin (IDE: − 7.9 kcal mol −1 ), indicating the potential of the ligands for diabetes treatment.Compound 4 also possesses a smaller HOMO-LUMO gap (4.585 eV) compared to Dapagliflozin (5.328 eV).A smaller HOMO-LUMO gap can be associated with greater ease of electron transfer, potentially contributing to stronger binding interactions with the target protein.For instance, compound 4 demonstrated a Homo-Lumo gap of 4.585, electronegativity (χ) of 4.128, and electrophilicity (ω) of 3.716, which are indicative of its favorable molecular characteristics.These properties align well with its strong binding affinity observed in the DFT calculations.Similarly, while compounds like Vitexin and the parent compound exhibited slightly lower binding affinities compared to compound 4, their respective molecular properties, such as Homo-Lumo gap and electronegativity, might provide insights into their interaction mechanisms with the receptor.
Furthermore, comparing the molecular properties of compound 4 with other compounds, such as myricetin and dapagliflozin, offers additional insights.Myricetin, with a binding affinity similar to compound 4, demonstrated comparable molecular characteristics, suggesting potential similarities in their interaction mechanisms with the receptor.Conversely, dapagliflozin, despite its lower binding affinity, exhibited distinctive molecular properties that may contribute to its interaction behavior with IDE.
By correlating the DFT results with the docking outcomes, we aim to provide a more holistic understanding of the ligand-receptor interactions and their underlying molecular mechanisms.This integration will enhance the interpretation of our findings and guide future studies aimed at optimizing ligand design for diabetes treatment.
NBO analysis can provide insights into the bonding and stability of molecules by examining the interactions between atoms and the distribution of atomic charges.The stabilization energies of the studied molecules are affected by intramolecular charge transfer, as shown in Figs.S2 to S19 in the SI.MEP maps in Fig. 2 (Figs.S2 to S19 in the SI) visually represent the electrostatic potential of the molecules and may be used to define areas of electrophilicity and nucleophilicity.The colors used in the MEP maps (blue for electrophilic regions and red for nucleophilic regions) indicate the strength of the electrical potential.
Electron-donating groups (EDGs) and electron-withdrawing groups (EWGs) can significantly influence the biological properties of a compound, particularly for small molecules that interact with biological targets like enzymes or receptors.EDGs tend to increase the electron density around a specific atom, making it more willing to accept protons (H +) and enhance the basicity of the compound, potentially affecting its interaction with acidic residues in biological targets.Depending on the functional group, EDGs can improve the water solubility of a compound by creating hydrophilic interactions with water molecules.This can be crucial for drug delivery and absorption in the body.In some cases, EDGs can hinder the metabolism of a compound by enzymes, potentially extending its duration of action in the body.EWGs pull electron density away from a specific atom, making it more acidic (prone to releasing protons).So that EWGs can influence the interaction with basic residues in biological targets.Depending on the functional group, EWGs can sometimes enhance the ability of a compound to pass through cell membranes, which is important for drug delivery 57,[72][73][74] .
The compound 4 has isopropyl group can slightly increase the electron density around the chromone core, potentially influencing its interaction with biological targets.In some antibiotics, the presence of an EDG like an amine group can enhance its hydrogen bonding with bacterial targets, leading to stronger binding and   www.nature.com/scientificreports/antibacterial activity 75,76 .The position of the EDG or EWG relative to the functional group involved in biological activity can significantly impact the effect.The overall balance of EDGs and EWGs within a molecule determines the net effect on its biological properties.Understanding these effects is crucial for medicinal chemistry, where researchers aim to design drugs with optimal potency, selectivity, and pharmacokinetic properties.

Analysis of physicochemical and pharmacokinetic properties
To determine compliance with Lipinski and Veber's criteria, it is essential to evaluate the physicochemical characteristics of 3-formyl chromone derivatives (1-16).Before a chemical can be administered orally, it must meet the five criteria as per Lipinski's recommendations: (a) molecular weight (MW) of 500 g/mol or less; (b) octanol-water partition coefficient (log P ≤ 5); (c) no more than five H-bond donors (HBD); no more than ten H-bond acceptors (HBA); and (e) no more than 140 Å 2 of topological polar surface area (TPSA) 66,67 .Additionally, Veber has proposed two more requirements for drug bioavailability: (a) TPSA must be less than or equal to 140 Å 2 (as per Lipinski's standards), and (b) the number of rotatable bonds (nrotb) in the molecules should be less than 10.The compounds (1-16) with the most promising biological activity were analyzed using SwissADME to assess their compliance with the Lipinski and Veber criteria.All compounds (1-16) were found to meet the prescribed limit ranges specified by the Lipinski and Veber criteria (Table 3).Additionally, there was a strong agreement (less than 6) for bioactive compounds regarding molecular weight (< 500 g/mol), MLOGP (4.15), and Log S (ESOL) within the specified limit ranges.Moreover, all compounds (1-16) that obtained a drug-like (bioavailability) score of 1 demonstrated their adherence to the criteria, providing robust theoretical support for developing innovative novel drugs.The drug-likeness parameters calculated using molinspiration for all 16 compounds are presented in Table 4, which includes their potential as G protein-coupled receptor (GPCR) ligands, ion channel modulators (ICM), kinase inhibitors (KI), nuclear receptor ligands (NRL), protease inhibitors (PI), and enzyme inhibitors (EI).Among these, compound 4 showed a GPCR value of − 0.66, and compared to 0.15 for the reference drug D (dapagliflozin).
In drug development, it is crucial to assess the pharmacokinetic properties, such as absorption, distribution, metabolism, excretion, and toxicity (ADMET), to ensure innovative drugs' efficient and economical creation.This study utilized SwissADME (http:// www.swiss adme.ch/ index.php) and admetSAR (http:// lmmd.ecust.edu.cn/ admet sar2/) software to evaluate the ADMET properties of all 16 compounds (1-16).The evaluation involved seven essential ADMET characteristics, as listed in Table 5, including cytochrome P450 enzymes (CYP3A4 and CYP2C19) inhibition, hERG inhibition, plasma protein binding (PPB), blood-brain barrier (BBB) penetration, human intestinal absorption (HIA), and synthetic accessibility (SA) score.It should be noted that drugs that affect the central nervous system (CNS) should have good blood-brain barrier penetration, whereas those that do should not penetrate the BBB.Low absorption is defined as < 0.1, medium absorption as 0.1-2, and high BBB Table 5.In silico prediction of selected ADMET parameters for the 3-formyl chromone derivatives (1-16).a HIA: Human Intestinal Absorption (%); BBB: Blood-Brain Barrier penetration; PPB: plasma protein binding; CYP3A4: Cytochrome P4503A4; CYP2C19: Cytochrome P4502C19; hERG: human ether-a-go-go-related gene, hERG inhibition potential (pIC 50 ), the potential risk for inhibitors ranges 5.5-6.b The values are using admetSAR.c The values are using swissADME..The results indicate that all compounds (1-16) meet the ADMET standards for drug-likeness (bioavailability), promising for developing novel drugs.
Based on our study, most compounds showed BBB penetration values between − 0.5 and − 0.7, similar to vitexin (− 0.70) and myricetin (− 0.77).The range of pIC50 predictions for potential risk from hERG activity inhibitors is between − 0.4 and − 0.82 68 .Our research showed that compound 4 had the lowest hERG pIC50 value of − 0.4744, while all other compounds had values below the reference range.To assess the synthetic accessibility (SA) score of the drug-like compounds (1-16), which ranges from 1 (very simple) to 10 (very difficult), we employed a unique method 64 .The SA scores of all the compounds (1-16) ranged from 2.4 to 2.6, which was lower than the reference drug, with a high SA score of 4.52.Vitexin and myricetin have higher SA scores than compounds (1-16).Our findings suggest that all compounds (1-16) meet the ADMET standards, indicating drug-likeness (bioavailability) and potential for developing novel medications.

Analysis of pharmacological activities
For a comprehensive investigation of the potential pharmacological effects of the 3-formyl chromone derivatives (1-16), Multilevel Neighborhoods of Atoms (MNA) descriptors were applied using the Prediction of Activity Spectra for Substances (PASS) method.The use of MNA descriptors allowed for a unique and detailed characterization of the chemical structures, which aided in clarifying the compounds' potential biological functions.The PASS method can simultaneously predict various biological activities, such as mutagenicity, carcinogenicity, teratogenicity, embryotoxicity, and primary and side pharmacological effects.A compound's biological activity is influenced by its structural and physicochemical characteristics, the biological entity (such as species, gender, age, etc.), and the treatment approach (such as dose, route of administration, etc.).The MNA descriptors are used by PASS to determine the probable activity (Pa) and probable inactivity (Pi) for the anticipated activity spectrum of a drug.These probabilities range from 0.000 to 1.000, where Pa + Pi ≠ 1. PASS predictions can be interpreted in several ways.For example, if Pa > 0.7, there is a high likelihood of finding the activity experimentally.If Pa is between 0.5 and 0.7, the chemical is likely to demonstrate the activity in an experimental setting, but it is likely different from recognized pharmaceutical drugs.The likelihood of finding the activity experimentally is lower if Pa < 0.5.
3-formyl chromone derivatives (1-16), except some of them exhibited Pa values greater than 0.5, as presented in Table 6.The histidine kinase inhibitor values ranged from Pa = 0.614 to Pa = 0.830, nearly identical to those of the vitexin (Pa = 0.819) and myricetin (Pa = 0.892).But the reference drug, dapagliflozin, has a lower Pa (0.439) value as a histidine kinase inhibitor.Compound 4 had a Pa value of 0.669 and a Pi value of 0.010 as a histidine kinase inhibitor, indicating that it was good in promoting antibacterial activity by targeting bacterial histidine kinase, although more potent than dapagliflozin.Dysregulated kinase activity is frequently associated with multiple disorders, including cancer.Kinases are enzymes that are essential for many biological functions, including cell proliferation, differentiation, and signaling.Moreover, all compounds, except for compounds 9 and www.nature.com/scientificreports/13, showed insulysin inhibitor values between Pa = 0.515 to Pa = 0.729, indicating that these compounds could potentially be therapeutic agents for diabetes treatment.Using a clearance mechanism anchored on degradation, IDE controls the amount of circulating insulin in several organs 69,70 .The reference drug shows no activity as an insulysin inhibitor.HIF1α expression is frequently increased in cancer cells and is linked to metastasis and tumor formation.Compound 4 was found to have a Pa value of 0.738 and a Pi value of 0.016 as a HIF1α expression inhibitor, indicating moderate potency in inhibiting the expression of HIF1α.Additionally, some compounds showed stronger activity as alcohol dehydrogenase inhibitors than the reference drugs.Our findings revealed that the Pa values of the studied compounds, acting as both insulin inhibitors (antidiabetic) and alcohol dehydrogenase inhibitors, ranged from 0.5 to 0.7, with only a few exceptions.This indicates a strong potential for these compounds to inhibit both insulin and alcohol dehydrogenase enzymes effectively.Additionally, our compounds exhibited apoptosis agonist activity, which is well-known for inducing cancer cell death 71 .
The findings of the molecular docking study indicated that compounds 1-16 exhibited a higher binding affinity for IDE, BHK, and COX, while their binding affinity for human CAD, p53, Mpro, and HIF-α was relatively lower.These results suggest that these compounds have the potential to be used as therapeutic agents for diseases related to the dysregulation of IDE, BHK, and COX.While molecular docking is a valuable tool for drug discovery, it is crucial to consider a compound's pharmacokinetics, toxicity, and metabolism, in addition to its binding affinity.In vitro and in vivo research is necessary to comprehend how compounds behave in living organisms and evaluate their safety and efficacy.By employing a combination of computational and experimental techniques, researchers can optimize the compounds and create dependable and effective therapeutic drugs.D (Reference drug: dapagliflozin).Furthermore, Figs. 3 and 4 present 2D diagrams for compound 4 in IDE and COX, depicting the ligand's interactions with the protein and how it affects the pathogens' active components.The bond lengths and residue numbers are indicated in Fig. 3d and Fig. 4d; the ligand-protein interaction of compound 4 through the amino acid residues of IDE is demonstrated, revealing around eight different bonds.Four of these bonds are hydrophobic, including Pi-Pi stacking on residue TYR A: 269, Pi-alkyl on residue LEU A: 156, and alkyl on residue ALA A: 434.Three conventional hydrogen bonds are at residues THR A: 271, ARG A: 432, and THR A: 163.The last bond is Pi-anion on residue GLU A: 160.The protein-ligand interaction of compound 4 in COX, shown in Fig. 4, exhibits five different interactions.Four of them are hydrogen bonds, including a conventional hydrogen bond at residue HIS A: 207, a carbon-hydrogen bond with THR A: 206, and a Pi-donor hydrogen bond with HIS A: 388.One bond is hydrophobic, including amide-Pi stacking with ALA A: 202.
Based on the results of molecular docking and interaction analysis, it has been concluded that compound 4 exhibits a stronger binding affinity towards the IDE protein.Additionally, all compounds also show good  S2).Thus, the in silico study suggests that compound 4 has promising potential as an insulin-degrading enzyme protein inhibitor.The selection of Insulin Degrading Enzyme (IDE) as the target protein for our in-silico study of potential anti-diabetic agents is well-supported by existing research.IDE plays a critical role in insulin degradation, and its inhibition has been demonstrated as a promising strategy for improving blood sugar control in diabetic patients.This is further evidenced by the existence of previous in vivo and in vitro studies exploring the therapeutic potential of IDE inhibitors 17 .Our study aimed to leverage computational methods to identify novel candidate molecules that could target IDE and potentially offer new avenues for developing anti-diabetic drugs.

In silico molecular dynamics
The goal of the MD simulations was to examine the stability and interactions of compound 4 with the IDE and COX proteins for a duration of 20 ns.The molecular docking analysis had previously shown that compound 4 had a strong binding affinity of − 8.5 kcal mol −1 with the IDE protein and a favorable binding affinity of − 8.1 kcal mol −1 with COX, indicating its potential as a drug candidate.The MD simulation results were evaluated using RMSD (Root Mean Square Deviation), RMSF (Root Mean Square Fluctuation), and Rg values, potential energies, temperature, and hydrogen bonding to gain a comprehensive understanding of the system's behavior over time.
The RMSD values derived from the MD simulations offer insights into ligand-protein complexes' stability and conformational changes, as presented in Fig. 5.For the ligand-IDE protein complex (green curve), the RMSD values were in the range of 0.2 to 0.5 nm, while for the ligand-COX protein complex (blue curve), the values were between 0.1 and 0.2 nm, indicating that both complexes are stable and exhibit minimal deviation from their initial positions during the simulation.However, the RMSD values for water and ions in the ligand-IDE protein complex (yellow curve) were approximately 7.7 nm, while those in the ligand-COX protein complex (brown curve) were about 7.3 nm.These observations suggest potential differences in the stability and dynamics of the   solvent molecules in the two complexes, which could be due to the size and shape of the binding sites, as well as the specificity and strength of ligand-protein interactions.It is important to note that the RMSD values for water and ions can be affected by various simulation parameters, including force field, simulation time, and binding site definition.Therefore, further analyses, such as solvent density profiles, hydrogen bonding patterns, and residence times, are recommended to elucidate the underlying factors contributing to the differences in RMSD values.
The RMSF is an important measure of the flexibility and mobility of protein-ligand complexes in molecular dynamics simulations.In this study, separate RMSF values were calculated for the protein and ligand in the IDE (6BF8) and COX (6Y3C) protein-ligand complexes with compound 4, and the results are shown in Fig. 6.A lower RMSF value indicates greater rigidity of a particular residue or atom, whereas a higher RMSF value indicates greater flexibility or mobility.The COX protein-ligand complex had an RMSF value of 0.1 nm (red line), indicating that it is relatively rigidly held in place, while the IDE protein-ligand complex had an RMSF value of 0.3 nm (black line), indicating that it is more flexible.A fluctuation observed in the IDE protein-ligand complex at the 15,000 range amino acid atom with a value of around 1.5 nm could be due to a number of factors, such as the inherent flexibility of the amino acid or its interaction with the ligand or solvent molecules.Additional analysis, such as examining the specific interactions of this residue with other parts of the protein or ligand, may provide further insights into the cause of this fluctuation.The differences in RMSF values between the COX and IDE protein-ligand complexes suggest that they have varying degrees of flexibility or mobility, which may have implications for their biological function.The fluctuation observed in the IDE protein-ligand complex at atom 15,000 highlights the importance of careful analysis of simulation results and emphasizes the potential complexity of protein-ligand dynamics.
The radius of gyration (Rg) is a measure of molecular compactness and is often used to monitor conformational changes in MD simulations.In this study, the Rg values for compound 4 bound to the IDE and COX proteins, were compared over a 20 ns simulation period, as shown in Fig. 7.It was observed that the Rg values for the IDE (protein-ligand) complex exhibited slight fluctuations within the range of 3.4-3.5 nm, while the Rg value for the COX complex remained consistently at 3.1 nm throughout the simulation period.This suggests that the COX complex is more stable and less prone to conformational changes compared to the IDE complex.Additionally, the Rg value for the ligand alone was found to be 0.3 nm in both cases, indicating that the ligand assumes a more compact conformation when bound to the proteins.Furthermore, the Rg value for the water-ion was larger than that of the protein-ligand complex, indicating a more loosely packed structure for water-ions compared to the ligand-protein complex.Specifically, the Rg values for the water-ion were 5.72 nm (red line) and 5.485 nm (black line) in the COX and IDE protein-ligand complexes, respectively, over the simulation period.Understanding the behavior of protein-ligand complexes can be valuable for the rational design of drugs with improved binding affinity and specificity.
Hydrogen bonds play a crucial role in protein-ligand interactions, providing valuable information about binding strength and specificity.In this study, MD simulations were used to investigate the number of hydrogen bonds formed between compound 4 and the active site of two proteins, IDE and COX, as shown in Fig. 8.
The number of hydrogen bonds between the ligand and the active site of the IDE protein fluctuated between 0 and 2 during the 20 ns simulation period (Fig. 8a).This suggests that hydrogen bond formation between the ligand and the IDE protein needs to be consistently maintained and is influenced by many factors.Consequently, the IDE protein and ligand interaction is dynamic rather than static.On the other hand, the number of hydrogen bonds formed between the ligand and the active site of the COX protein ranged from 0 to 1 throughout the 20 ns simulation (Fig. 8b).This indicates that the hydrogen bond formation between the COX protein and the ligand is relatively stable and consistent, implying a more static and less dynamic interaction compared to the IDE protein-ligand interaction.
These findings demonstrate that multiple factors, including conformational changes, ligand movement, and the specific characteristics of the protein, influence the number of hydrogen bonds formed between a protein and a ligand.The observed dynamic nature of hydrogen bond formation in the IDE protein-ligand interaction holds significant implications for drug design and optimization.It emphasizes the importance of considering the dynamic aspects of protein-ligand interactions in rational drug design processes.
In MD simulations, monitoring the temperature of the system is crucial to ensure its stability.In this study, the IDE and COX proteins' temperature was relatively stable, fluctuating between 298 and 302 Kelvin during the 20,000 ps simulation period (Fig. 9a,c).This indicates that the simulation was well-controlled and the system remained within an appropriate temperature range.However, the system's potential energy, which reflects the interactions between atoms, showed fluctuations throughout the simulation period.For the IDE protein, the potential energy fluctuated between − 1.680e +06 and − 1.677e +06 kJ mol −1 (Fig. 9b), while for the COX protein, it varied between − 1.325e +06 and − 1.323e +06 kJ mol −1 (Fig. 9d).These fluctuations imply that the interactions between the atoms in the system are dynamic and continuously changing.In summary, the stable temperature observed in the study indicates a well-controlled simulation, but the fluctuation in potential energy highlights the dynamic nature of protein-ligand interactions.

PCA analysis
Conformational principal component analysis (PCA) was performed on the simulated molecular dynamics (MD) trajectories of the IDE protein-ligand complex with compound 4 at 300 K.The aim was to identify the variability, collective motions, and changes in protein conformations observed in subsets of the primary components throughout the MD simulations.The PCA analysis was conducted using the Bio3D program [55][56][57] .The resulting eigenvalues versus eigenvector plots are displayed in Fig. 10.The first three eigenvectors, namely PC1, PC2, and PC3, were utilized to compare the dominant motions within the smaller trajectory subgroup.Colored dots represent the captured variance by the eigenvectors.Regarding the internal motions observed in the MD trajectory, the protein-ligand complex simulation at 300 K exhibited the most extensive variability in PC1, accounting for 26.13% of the total variance.PC2 showed a lower percentage of variation (16.3%), and PC3 showed 15.48% of the conflict.These three components accounted for 57.9% of the total variance (Fig. 10).
Furthermore, the simulation appeared to have converged, as evidenced by the cosine content value of the eigenvectors, which was computed to be 0.32 based on the MD trajectory.The conformational PCA analysis provided insights into the dominant motions and variability within the IDE protein-ligand complex during the MD simulation at 300 K.These findings contribute to a better understanding of the dynamic behavior and conformational changes occurring in the protein-ligand complex.

Conclusion
This study employed computational methods to virtually screen a library of 6-substituted 3-formyl chromone derivatives (compounds 1-16) for potential therapeutic applications.The results revealed promising anti-diabetic activity, particularly for compound 4, demonstrating a strong binding affinity to a crucial enzyme involved in diabetes management (IDE).Notably, compound 4's docking score to IDE (-8.5 kcal mol -1 ) was comparable to the established medication dapagliflozin (-7.9 kcal mol -1 ).Additionally, all compounds adhered to Lipinski and Veber's rules, indicating good oral bioavailability and potentially lower toxicity.These findings suggest that compound 4 represents a significant step forward in the discovery of novel anti-diabetic therapies.The selection of Insulin Degrading Enzyme (IDE) as the target protein for our in-silico study of potential anti-diabetic agents is well-supported by existing research.IDE plays a critical role in insulin degradation, and its inhibition has been demonstrated as a promising strategy for improving blood sugar control in diabetic patients.This is further evidenced by the existence of previous in vivo and in vitro studies exploring the therapeutic potential of IDE inhibitors.Future investigations will involve in vitro and in vivo experiments to validate the computational predictions and elucidate the compound's mechanism of action.This research opens new avenues for developing more effective and potentially less toxic treatments for diabetes.

Figure 5 .
Figure 5. RMSD evolution for merge docked protein-ligand complex between two proteins and modeled ligand 4, (IDE (6BF8), green line and COX (6Y3C), (4, red and black), water and ions in the ligand-IDE protein complex (yellow) and water and ions in the ligand-COX protein complex (brown) during 20 ns MD simulation.

Figure 6 .Figure 7 .
Figure 6.RMSF evolution for the IDE protein-ligand complex (black) and the COX protein-ligand complex (red) and modeled ligand 4, during 20 ns MD simulation.

Figure 8 .
Figure 8. Plots of the number of hydrogen bond versus time (ps) for (a) between protein complex of PDB: 6BF8 and compound 4 and (b) between protein complex of PDB: 6Y3C and compound 4 during 20 ns MD simulation.

Figure 9 .
Figure 9.The temperature and potential energy curves over the course of the 20 ns MD simulation for the potential energy graph for the protein-ligand complex of IDE (a) and COX (c); the temperature graph of the protein-ligand complex of IDE (b) and COX (d) with compound 4.

Figure 10 .
Figure 10.PCA of the IDE complex with compound 4 at 300 K MD trajectory, where red dots represent stable conformations and black dots indicate energetically unstable conformations.